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Abstract.  We  address  methods  of  speeding  up  the  calculation  of  the 
well-founded  semantics  for  normal  propositional  logic  programs.  We  first 
consider  two  algorithms  already  reported  in  the  literature  and  show  that 
these,  plus  a  variation  upon  them,  have  much  improved  worst-case  be¬ 
havior  for  special  cases  of  input.  Then  we  propose  a  general  algorithm 
to  speed  up  the  calculation  for  logic  programs  with  at  most  two  positive 
subgoals  per  clause,  intended  to  improve  the  worst  case  performance  of 
the  computation.  For  a  logic  program  V  in  atoms  A,  the  speed  up  over  the 
straight  Van  Gelder  alternating  fixed  point  algorithm  (assuming  worst- 
case  behavior  for  both  algorithms)  is  approximately  .  For 

I'PI  >  |A|^,  the  algorithm  runs  in  time  linear  in  \V\. 

1  Introduction 

Logic  programming  researchers  have,  over  the  last  several  years,  proposed  many 
logic-based  declarative  semantics  for  various  sorts  of  logic  programming.  The 
hope  is  that,  if  they  can  be  efficiently  implemented,  these  semantics  can  restore 
the  separation  between  logic  and  implementation  that  motivated  the  developers 
of  Prolog  but  that  Prolog  did  not  achieve.  In  the  last  few  years,  several  projects 
have  begun  to  produce  working  implementations  of  some  of  these  semantics. 

A  major  difficulty  with  all  such  approaches  is  the  complexity  of  the  calcu¬ 
lations,  and  various  approaches  have  been  tried  to  speed  the  calculations  up. 
We  feel  that  serious  investigation  of  new  algorithms  to  compute  these  semantics 
may  have  significant  practical  importance  at  this  time. 

Two  quite  popular  semantics  for  the  class  of  normal  logic  program  are  the 
stable  [4]  and  the  well-founded  [14].  For  propositional  logic  programs,  computing 
inferences  under  the  stable  semantics  is  known  to  be  co-NP-complete  [7];  com¬ 
puting  inferences  under  the  well-founded  is  known  to  be  quadratic  time  (folk¬ 
lore),  and  no  faster  algorithm  is  known.  Work  in  [8]  (on  questions  of  updating 
accessibility  relations  in  graphs)  suggests  that  it  may  be  quite  difficult,  using 
standard  techniques,  to  break  the  quadratic-time  bound  on  the  well-founded 
semantics.  We  are  concerned  here  with  speeding  up  the  computation  of  the 
well-founded  semantics;  in  particular,  we  find  a  large  class  of  propositional  logic 
programs  for  which  we  can  break  the  quadratic  time  bound.  As  an  additional 
application,  we  also  note  that,  if  the  calculation  of  the  well-founded  semantics 
can  be  made  sufficiently  fast,  that  calculation  may  also  prove  a  highly  useful 
subroutine  in  computing  under  the  stable  semantics. 

The  standard  calculation  methods  for  the  well-founded  semantics  are  based 
upon  the  alternating  fixed-point  algorithm  of  Van  Gelder  [13].  Several  researchers 
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a  propositional  logic  program  is  generally  not  particularly  efficient,  since  unifor¬ 
mities  over  the  variables  are  lost.  Accordingly,  the  utility  of  the  methods  we  use 
here  will  be  determined  in  part  by  whether  they  can  be  incorporated  in  faster 
methods  for  handling  first  order  logic  programs.  This  we  intend  to  pursue  in 
future  research. 

2  Terminology  and  Notation 

A  normal  propositional  logic  program  V  over  a  set  of  atoms  A  consists  of  a  finite 
set  of  rules  of  the  form 

a  ^  /?i  A  /?2  A  •  •  •  A  /?i , 

where  each  Pi  is  a  literal,  i.e.,  a  proposition  letter  in  A  or  its  negation.  We  shall 
use  these  meanings  of  V  and  A  henceforth  in  the  paper. 

Definition!.  1.  Traditionally  associated  with  a  logic  program  “P  is  a  set  of 
binary  dependency  relations  on  the  atoms  of  the  programs.  For  proposition 
letters  a,  6: 

-  b<posa  {a  depends  directly  positively  upon  6)  if  there  is  a  rule  in  V  with 
head  a  and  subgoal  6. 

-  b<neg^  depends  directly  negatively  upon  6)  if  there  is  a  rule  in  V  with 
head  a  and  subgoal  -i6. 

2.  Two  further  dependency  relations  are  defined  from  the  above  relations,  using 
the  obvious  regular-set  type  notation,  with  sequence  meaning  concatenation: 

-  U  <neg^  Relation  <dep  is  called  the  direct  dependency  rela¬ 
tion. 

-  <ineg  —  <neg<pos* •  Relation  <ineg  is  the  relation  of  dependency  through 
exactly  one  negation  (and  any  number  of  positives). 

3.  A  logic  program  V  is  stratified  if  its  relation  <ineg  is  acyclic  (i.e,  if  its 
transitive  closure  is  irreflexive) . 

4.  A  logic  program  V  is  positive- a  cyclic  if  its  relation  <^^5  is  acyclic. 

For  both  stratified  and  positive-acyclic  logic  programs,  it  turns  out  that  the 
well-founded  semantics  can  be  found  in  time  linear  in  the  size  of  the  logic  pro¬ 
gram. 

In  this  paper  we  make  some  fairly  standard  assumptions  about  how  complex¬ 
ity  is  measured.  We  assume  that  every  propositional  logic  program  has  proposi¬ 
tion  letters  xi, . . . ,  for  some  n.  We  assume  that  computations  are  performed 
on  a  machine  with  random-access  memory,  and,  in  particular,  that  accessing 
array  positions,  following  pointers,  and  copying  pointers,  can  done  in  constant 
time.  (Similar  assumptions  were  made,  for  example,  in  [2]).  Finally,  we  assume 
that  certain  arithmetic  operations  (initialization  to  0,  adding,  dividing,  copying, 
incrementation,  decrementation,  and  comparison)  on  natural  numbers  used  for 
indices  and  reference  counts  can  be  done  in  unit  time. 


3  A  Hypergraph  Representation  of  Programs 

To  picture  algorithms  to  compute  the  well-founded  semantics,  we  frequently 
think  of  logic  programs  as  representing  directed  hypergraphs  in  a  fairly  straight- 
forward  way: 

Definition 2.  Let  P  by  a  logic  program  over  atoms  set  A,  With  only  trivial 
rewriting,  we  may  eissume  there  is  exactly  one  rule  s  with  no  positive  subgoals, 
and  for  that  s,  -15  does  not  appear  in  V.  (If  necessary,  add  such  a  proposition 
letter  to  a.  If  any  other  rule  has  no  positive  subgoals,  add  in  s  as  a  positive 
subgoal.)  We  shall  call  5  the  sonrce.  We  shall  ignore  rule  s  in  building  the 
hypergraph,  instead  treating  the  atom  s  specially. 

Set  A  is  the  set  of  vertices  of  the  hypergraph.  First  group  together  rules  that 
have  the  same  heads  and  positive  subgoals:  group  rules 

a  ^  61  A  •  •  •  A  A  -ic\  A  •  •  •  A 

a  ^  61  A  •  •  •  A  A  A  •  ■  *  A 

a  ^  61  A  •  •  •  A  A  -icj  A  •  •  •  A  “>0^^ 

into  “rules”  the  form 

a  <-  61  A  •  •  •  A  6i  A  ((-.cj  A  •  •  •  --cjj  V  (-icf  A - >c]J  V  •  •  •  V  (->cj  A - ■cj'J) 

The  head  and  positive  subgoals  of  the  “rule”  will  be  considered  to  be  a 
directed  hypergraph  edge  (a,  {61, . . . ,  directed  from  the  subgoals  to  the  head. 
The  remaining  conjunct, 

((-iCi  A  •  •  ’  A  --cjj  V  (“^Ci  A  •  -  A  -«c|J  V  •  •  •  V  (“-Ci  A  •  •  •  A 

will  be  called  the  presupposition  of  the  edge.  (Intuitively,  the  rule  is  applied 
only  if  the  presupposition  is  either  known  to  be  true  or  not  known  to  be  false, 
depending  upon  circumstances.)  Variable  a  will  also  be  called  the  head  of  the 
edge,  and  61, . . .,  62  will  be  called  the  tails  of  the  edge. 

The  degree  d{a)  of  any  a  G  -4  is  the  sum  of  the  numbers  of  tails  on  all 
hyperedges  with  head  a. 

We  shall  use  both  hypergraph-theoretic  language  and  logic-program  language 
in  this  paper,  whichever  seems  more  obvious  at  any  point. 

Definitions.  Given  a  partial  function  t:A^{T,F}  we  can  easily  assign  an 
interpretation  r{p)  for  p  the  presupposition  of  any  rule: 

r(((-'Ci  A  •  •  •  A  -'cjj  V  (--Cl  A  •  •  •  A  V  *  •  •  V  (--Ci  A  •  •  •  A 

is  true  if  every  conjunct  of  some  disjunct  is  true  in  r,  false  if  some  conjunct 
of  every  disjunct  is  false  in  r,  and  undefined  otherwise.  Moreover,  this  can  be 
computed  by  substituting  T  literals  true  in  r  and  F  for  literals  false  in  r  and 
doing  standard  simplifications. 


have  observed  that,  for  propositional  logic  programs,  the  (worst  case)  time  taken 
by  that  algorithm  to  compute  the  well-founded  partial  model  is  quadratic  in  the 
size  of  the  program.  More  specifically,  for  a  propositional  logic  program  V  with 
atom  set  the  complexity  is  O(l^ll'Pl):  Van  Gelder’s  algorithm  can  make  at 
most  1^1  passes  before  reaching  a  fixed  point  (plus  another  to  verify  that  a  fixed 
point  has  been  reached),  and  each  pass  consists  of  finding  the  least  models  of  two 
Horn  clause  programs  derived  (essentially  via  the  Gelfond-Lifschitz  Transform) 
from  V.  Since  finding  the  least  model  of  a  Horn  clause  program  is  linear  in  the 
size  of  the  program  [2,  6,  10],  this  gives  the  stated  complexity. 

Some  techniques  are  know  for  speeding  up  the  calculation  on  “nice”  pro¬ 
grams.  We  investigate  two  of  them  and  prove  that  they  do  achieve  optimal 
speed-up  on  certain  classes  of  programs.  We  then  turn  to  improving  the  worst- 
case  behavior.  Our  main  result  is  Theorem  10:  We  show  that  our  Algorithm  6 
computes  the  well-founded  partial  model  of  any  program  V  in  proposition  let¬ 
ters  A  in  time  0(17^1  -h  \A\^  +  1^|^),  so  long  as  V  has  at  most  two  positive 

subgoals  per  rule.  Thus  when  V  is  sufficiently  larger  than  A^  this  new  algorithm 
will  thus  have  a  noticably  better  worst-case  time  than  Van  Gelder’s  original 
construction.  Thus  also,  |>l|^  G  0(|7^1),  our  algorithm  takes  time  linear  in  \V\, 
which  is  clearly  optimal. 


Restrictions 

We  shall  be  discussing  only  (finite)  propositional  logic  programs.  Now  for  many 
practical  purposes,  propositional  logic  programs  are  not  especially  interesting 
by  themselves,  but  semantics  for  first  order  logic  programs  (a.k.a.  intensional 
databases,  or  IDE’s)  over  (finite)  extensional  databases  (EDB’s)  are  generally  de¬ 
fined  by  translating,  via  Herbrand  expansions,  these  logic  programs  and  databases 
into  (finite)  propositional  logic  programs. 

In  our  principal  algorithm,  Algorithm  6,  we  shall  consider  only  logic  pro¬ 
grams  with  at  most  two  positive  subgoals  per  rule.  In  some  circumstances,  the 
restriction  to  two  positive  subgoals  per  rule  is  no  particular  hindrance;  it  is  well 
known  that,  given  any  logic  program  V  in  proposition  letters  A^  one  can  easily 
construct  a  logic  program  in  proposition  letters  A"^  D  which  is  a  conser¬ 
vative  extension  of  V  under  many  logic  programming  semantics,  including  the 
well-founded,  the  stable,  and  Fitting’s  Kripke-Kleene  (3-valued  program  comple¬ 
tion)  semantics  [3].  However,  in  general  A'^  will  be  substantially  larger  than  A, 
making  1^”^ lip'll  substantially  larger  than  l-dH'Pl  —  and  thus  possibly  negating 
all  benefits  of  our  algorithm. 

Nevertheless,  we  feel  Algorithm  6  may  prove  fairly  widely  useful.  For  exam- 
ple,  there  are  very  natural  conditions  on  IDE’s  that  guarantee  that  cyd^/jP!)  < 
1  for  all  large  enough  EDE’s,  and  for  which  the  standard  reduction  to  a  program 
with  two  positive  subgoals  per  rule,  done  in  the  IDE  and  thus  automatically 
extended  to  the  propositional  program,  leaves  \A'^\  G  0(1^|). 

Of  course,  in  actual  practice,  computing  the  well-founded  semantics  for  a 
particular  IDE  over  various  EDE’s  by  first  translating  each  IDE/EDE  pair  into 


The  algorithm  embodied  in  Van  Gelder’s  alternating  fixed  point  definition  of 
the  well-founded  semantics  [13]  translates  almost  immediately  into  the  following 
algorithm  on  their  corresponding  hypergraphs: 

Algorithm  1.  (Van  Gelder)  We  are  given  a  program  V  and  its  hypergraph  in¬ 
terpretation  H.  We  maintain  a  partial  truth  assignment  r,  consisting  of  the  truth 
values  already  inferred,  and  two  approximations  to  ,  an  over- approximation  H 
and  an  under- approximation  li.  Initially,  r  =  0  and  and  %  is  l^he  set  of 

hyperedges  of  H  with  empty  presuppositions.  We  maintain  two  loop  invariant 
H  is  the  set  of  hyperedges  of  li  with  presuppositions  p  where  r(p)  =  T,  and  H 
is  the  set  of  hyperedges  of  H  with  presuppositions  p  where  r(p)  F. 

Repeat  until  no  changes  are  made  during  an  entire  pass: 

1.  Van  Gelder  edge  deletion  step: 

Find  all  vertices  a  £  A  accessible  from  s  in  hypergraph  li. 

For  every  vertex  a  accessible  in  H 
set  r(a)  =  T. 

For  every  hyperedge  e  olH  —  %  with  presupposition  p  containing  -la, 
if  r(p)  =  F 

remove  h  from  H. 

2.  Van  Gelder  edge  addition  step: 

Find  all  vertices  a  £  A  accessible  from  s  in  hypergraph  H. 

For  every  vertex  a  not  accessible  in  H 
set  r(a)  —  F. 

For  every  hyperedge  e  oiH  —  li  with  presupposition  p  containing  “ia, 
if  r(p)  =  T 
add  h  to  li. 

The  well-founded  partial  model  of  V  is  the  final  partial  interpretation  r. 

4  Elaborating  Upon  Known  Speed-Ups 

“Best-case”  and  informal  “Average-case”  speedup  techniques  for  the  well-founded 
semantics  have  been  fairly  widely  observed.^  The  algorithmic  methods  of  this 
section  are  already  known;  what  is  new  here  is  (1)  a  new  algorithm,  which  explic¬ 
itly  combines  two  older  ideas,  and  (2)  some  propositions  about  these  algorithms. 

The  first  speedup  we  first  heard  from  Subrahmanian  [11].  Fitting’s  Kripke- 
Kleene  semantics  [3]  (a.k.a.  3-valued  program  completion  semantics)  for  logic 
programs  can  be  calculated  much  as  is  the  well-founded  semantics,  but  with 
Van  Gelder’s  search  for  inaccessible  nodes  of  H  replaced  by  searches  for  nodes 
(other  than  the  source  s)  with  in-degree  0.  (Essentially  this  algorithm  is  in  [3].) 


^  We  say  “informal  Average-case”  because  there  is  no  accepted  distribution  on  which 
to  base  the  average  and  because,  so  far  as  we  know,  the  conclusions  are  based  only 
upon  experimentation. 


Proposition  4.  (folklore)  The  above-sketched  computation  of  the  Kripke-Kleene 
semantics  for  a  propositional  logic  program  V  can  be  done  in  Ol'Pj  time. 

Idea  of  Proof  The  proof  depends  upon  the  fact  that  finding  whether  a  node  is 
inaccessible  from  s  takes  a  search,  while  a  node  with  in-degree  0  can  be  identified 
in  constant  time  with  the  right  data  structure.  I 

Essentially  the  same  proof  gives  that 

Propositions.  Suppose  an  algorithm  is  of  the  following  form,  and  a  logic  pro¬ 
gram  V  is  given: 

Set  r  =  0 

Repeat  until  a  fixed  point  is  reached 
Use  Fitting  ^s  algorithm  to  extend  r 

Set  r{a)  =  F  for  some  proposition  letters  a  previously  not  in  the  domain(T ). 
Then  the  total  time  taken  by  all  iterations  of  the  Fitting  algorithm  is  O(l'Pl).^ 
Algorithm 2.  (Subrahmanian  et.  al.) 

Initialize  r  =  0,  =  W ,  and  Ti  =  the  hyperedges  of  H  with  empty 

presuppositions. 

Repeat  until  no  edges  are  deleted  in  step  2: 

1.  Fitting  Algorithm: 

Repeat  until  no  changes  are  made  during  an  entire  pass: 

For  every  vertex  a  where  r(a)  is  undefined  and  for  some  hyperedge 
h  of  TL  6  of  /i,  r(6)  =  T, 

set  r(a)  =  T.  _ 

For  every  vertex  a  where  T(a)  is  undefined  and  d(a)  =  0  in 
set  r(a)  =  F. 

For  every  hyperedge  e  oili  with  presupposition  p, 

if  r{p)  —  F  _ 

remove  h  from  7i. 

For  every  hyperedge  e  oiH-Ji  with  presupposition  p, 
if  r(p)  =  T 
add  h  to  Ti. 

2.  Make  one  pass  of  Van  Gelder  edge  addition  step 


The  well-founded  partial  model  of  V  is  the  final  partial  interpretation  r . 

Subrahmanian  noted  that  Algorithm  2  is  faster  than  Algorithm  1  on  most  of 
the  logic  programs  his  group  randomly  generated.  We  find  a  partial  explanation 
for  this  in  the  following  proposition: 

^  We  have  sloughed  over  an  important  detail  here  since  it  does  not  affect  this  paper: 
we  may  have  to  use  a  four- valued  logic,  with  the  four  truth  values  T,  F,  undefined, 
and  contradictory. 


Proposition  6.  Lei  V  be  a  logic  program. 

1.  IfV  is  positive-acyclic,  then  the  Kripke-Kleene  semantics  forV  agrees  with 
the  well-founded  semantics,  and  hence  Algorithm  2  finds  the  well-founded 
semantics  in  time  0(|7^|). 

2.  IfV  consists  of  a  positive- acyclic  program  plus  some  additional  rules  giving 
only  k  hyperedges  (possibly  with  very  long  presuppositions),  then  Algorithm  2 
finds  the  well-founded  semantics  in  time  0(^;17^|). 

Proof  1.  In  [9]  we  showed  that,  for  a  logic  program  with  no  positive  subgoals, 
the  Kripke-Kleene  and  well-founded  semantics  agree.  It  is  routine  to  extend 
that  proof  to  all  positive-acyclic  logic  programs. 

2.  The  difference  between  the  Kripke-Kleene  semantics  and  the  well-founded 
semantics  is  that  the  well-founded  semantics  identifies  positive  dependency 
cycles.  Since  the  first  part  of  each  pass  through  the  outside  loop  of  the  algo¬ 
rithm  is  Fitting’s  algorithm,  and  since  A  is  finite,  each  atom  found  by  step  2 
must  be  in  a  positive  cycle,  all  of  whose  atoms  are  inaccessible.  Thus  one  of 
the  k  special  edges  (the  “back-edges”)  must  be  in  this  cycle.  Accordingly,  at 
least  one  back-edge  will  be  added  to  Ti  in  each  pass.  Thus,  after  k  passes,  no 
additional  cycles  in  H  remain  to  be  discovered,  and  all  additional  inferences 
may  be  made  by  the  Fitting  semantics.  I 

A  second  speed-up  is  motivated  by  the  fact  that  calculating  the  perfect  model 
of  a  stratified  logic  program  (which  is  also  the  well-founded  partial  model  and  the 
unique  stable  model)  can  be  done  in  linear  time.  Consider  {A,  <dep)  as  a  directed 
graph.  In  linear  time,  form  the  strongly  connected  components  of  the  graph  as  in 
[12],  and  topologically  sort  the  components.  Let  Vi  be  the  set  of  rules  of  V  whose 
heads  occur  in  the  first  component  of  the  graph.  Vi  turns  out  to  be  pure  Horn, 
so  its  perfect  model  is  its  minimal  model  Mi,  which  can  be  computed  in  time 
linear  in  I'Pi]  [2,  6,  10].  Let  V2  be  the  set  of  rules  of  V  whose  heads  occur  in  the 
second  component.  Substitute  into  V2  the  truth  values  computed  for  variables  in 
the  first  component  and  simplify.  The  result  is  a  pure  Horn  program,  so  compute 
its  minimal  model,  in  time  linear  in  17^21)  and  the  perfect  model  of  Vi  U  V2  is 
Ml  U  M2.  Continue  this  way  through  all  the  strongly  connected  components. 
The  result  is  the  perfect  model,  which  has  been  constructed  in  time  linear  in 
1^1- 

Now  for  the  well-founded  and  Kripke-Kleene  semantics,  the  same  partition¬ 
ing  into  strongly-connected  components,  decomposition  of  the  program  by  the 
components  and  construction  of  the  semantics  for  each  piece  separately  cor¬ 
rectly  constructs  the  semantics.  We  shall  call  the  levels  7^i,'p2)-*  -  strata,  or 
modules.  (These  were  investigated  in  the  current  context  in  [9]  and  [1].)  Un¬ 
like  with  stratified  programs,  there  is  no  guarantee  that  the  strata  are  Horn, 
However,  for  programs  with  more  than  one  strongly  connected  component,  this 
gives  a  divide-and-conquer  approach  which  can  sometimes,  as  with  stratified 
programs,  reduce  the  complexity  of  finding  the  well-founded  model.  (The  same 
technique  has  been  used  [5]  to  speed  up  the  search  for  individual  stable  models. 


though  the  technique  does  not  work  directly  for  computing  the  intersection  of 
stable  models  [9].) 

We  can  combine  this  technique  with  Algorithm  2,  giving  what  appears  to 
be  a  fairly  useful  algorithm.  We  have  chosen  to  start  first  with  the  Fitting  step 
since,  for  example,  calculations  from  non-stratified  IDE’s  over  nice  EDB’s  can 
reduce  to  stratified  programs  after  only  one  or  two  passes  of  the  Fitting  method, 
e.g.,  for  modularly  stratified  programs. 

In  the  following  algorithm,  we  refer  to  the  direct  dependency  relation  <dep 
on  H.  This  is  the  analogue  of  the  logic  programming  definition:  a<depb  if  there  is 
a  hyperedge  in  H  with  head  a  and  with  either  b  in  its  tail  or  appearing  in  the 
presupposition  for  the  hyperedge.  Since  the  algorithm  is  merely  a  combination 
of  previous  pieces,  we  summarize  the  algorithm.  The  algorithm  is  initially  called 
with  r  =  0  and  ^  and  ?i  as  in  Algorithms  1  and  2.  The  details  of  partitioning 
into  strata  are  can  be  filled  in  easily  by  the  reader. 

Algorithms. 

Initialize  r  =  0.  _ 

Perform  the  Fitting  step,  updating  r  and  all  current  subgraphs  of  Tf  and  Ti. 
Decompose  A  into  its  strongly  connected  components  Ai^..  .^Aj  under  <dep 
(defined  from  7f). 

Topologically  sort  the  components,  .42, . .  • ,  4n- 
Construct  the  subprograms  'Pi,'P2,  •  • 

For  2  =  1  to  n:  _ 

Construct  the  subhypergraphs  ?£*  and  Hi . 

Perform  one  Van  Gelder  edge  addition  step  on  'P*, 

updating  r  and  all  current  subgraphs  of  H. 

If  any  changes  were  made  by  the  Van  Gelder  step 
Call  Algorithm  3  recursively  on  Vi. 

The  well-founded  partial  model  of  V  is  the  final  partial  interpretation  r. 

Proposition  7.  Consider  the  class  of  programs  V  consisting  of  strata  Pi, . . . 
where  each  stratum  is  either  Horn  or  positive- acyclic.  Algorithm  3  finds  the  well- 
founded  partial  model  of  any  such  P  in  time  0(|P|)- 

5  Improving  the  Quadratic  Bound  on  the  Worst  Case 

The  techniques  of  the  previous  section  often  speed  up  logic  program  evaluation, 
but  they  do  not  improve  on  the  worst-case  behavior  of  Van  Gelder ’s  algorithm. 
Moreover,  writing  an  IDE  for  testing  some  property  P  of  interest  which,  for  all 
interesting  EDE’s,  the  properties  of  Proposition  7  hold,  may  be  very  difficult  for 
some  properties  P.  We  would  like  a  far  more  general  speedi^  technique. 

Van  Gelder’s  algorithm  repeatedly  deletes  edges  from  H  and  rechecks  for 
accessibility  from  s.  Reif  [8]  has  studied  algorithms  for  updating  vertex  acces¬ 
sibility  information  in  directed  graphs  during  dynamic  edge  deletion.  His  work 


suggests  it  may  in  general  be  very  difficult  to  speed  up  in  the  worst  case  past  the 
size  of  the  graph  times  the  number  of  cycles  of  deletions.  Thus  his  result  seems  to 
suggest  that  generally  fast  well-founded  semantics  algorithms  might  be  difficult 
or  impossible  to  find,  even  for  logic  programs  with  ai  most  one  positive  subgoal 
per  rule.  This  same  lower  bound  applies  to  all  algorithms  for  the  well-founded 
semantics  which  we  have  previously  seen,  arousing  our  interest  in  the  question 
of  whether  that  quadratic  time  bound  can  be  beaten.  In  what  follows,  we  do 
improve  in  this  bound,  albeit,  as  previously  noted,  only  for  logic  programs  with 
at  most  two  positive  subgoals  per  rule  and  where  c^{\A\/\V\)  <  1  for  some  con¬ 
stant  c.  The  heart  of  the  difference  between  Reif’s  work  and  ours  is  that  we  are 
looking  for  a  fixed  point  of  an  operator  and  may  stop  as  soon  as  one  is  reached. 

^From  now  on  we  limit  attention  to  the  class  of  propositional  logic  programs 
V  over  atoms  A  with  at  most  2  positive  subgoals  per  clause. 

Observe  that,  with  Van  Gelder’s  algorithm,  it  takes  a  relatively  large  amount 
of  work  to  find  an  unfounded  set:  an  entire  depth-first  search  of  the  hypergraph 
U.  If  the  unfounded  set  is  large,  there  is  a  fairly  good  payoff  for  the  work  of 
searching,  but  if  the  unfounded  set  is  small,  there  has  been  a  high  cost  to  identify 
a  few  truth  values.  It  is  fairly  easy  to  construct  examples  where  all  the  algorithms 
of  the  previous  section  take  \A\  passes  to  discover  unfounded  sets,  each  of  small 
size.  Our  goal  here  is  to  find  somewhat  faster  ways  to  find  small  unfounded  sets. 

The  basic  idea  is  this;  Proceed  as  in  Algorithms  2  and  3.  Usually,  before  doing 
the  Van  Gelder  edge  addition  step  or  attempting  to  decompose  the  program  into 
strata  (modules)  (the  two  time-consuming  steps),  first  do  a  depth-first  search  of 
a  “small”  approximation  W  to  H,  an  approximation  chosen  so  that  any  atom 
inaccessible  in  W  is  also  inaccessible  in  H;  since  JV  is  smaller  than  ,  the  depth- 
first  search  will  proceed  faster.  If  any  inaccessible  nodes  are  found,  adjust  r  and 
hypergraph  %_  as  before  and  go  back  to  repeat  the  Fitting  computation,  having 
found  inaccessible  nodes  more  quickly  than  the  with  the  other  algorithms.  In 
general,  only  if  no  inaccessible  nodes  are  found  do  we  do  a  full  depth-first  search. 
If  no  inaccessible  nodes  are  found  by  the  full  depth-first  search  either,  then  the 
algorithm  is  finished,  with  just  an  additional  0(|7^|)  steps  used  in  checking  that 
we’re  done.  If,  on  the  other  hand,  some  inaccessible  nodes  are  found,  i.e.,  a  non¬ 
empty  unfounded  set  is  found,  we  shall  show  that  a  “large”  number  of  inaccessible 
nodes  must  be  found.  Thus  the  cost  of  the  depth-first  search,  averaged  over  all 
the  inaccessible  nodes  found,  is  not  too  high. 

If  ei,e2  are  hypergraph  edges  with  the  same  head,  and  if  every  tail  of  ei  is 
also  a  tail  of  63,  then  say  that  ei  is  a  subedge  of  62,  and  62  extends  ei. 

We  do  the  approximation  to  hypergraph  H  described  above  in  two  steps. 
We  start  with  a  subalgorithm  that  applies  only  to  hypergraphs  H  where  each 
subedge  (a,  {b})  occurs  in  <  jjl  edges  of  H  for  some  p.  It  will  use  a  parameter  T 
to  be  optimized  later. 


Algorithm  4.  Let  X  =  {a  E  A  :  a  is  the  head  of  >  T  edges  of  7i}. 

Let  W"  =  {(a,  {s})  :aEX}U{eEH:  head(e)  ^  X}. 

Do  a  depth-first  search  of  to  find  all  inaccessible  nodes. 

If  no  inaccessible  nodes  are  found, 

perform  a  depth-first  search  of  H  to  find  inaccessible  nodes. 

Lemma  8,  1,  If  a  vertex  is  inaccessible  in  ,  it  is  inaccessible  in  %. 

2.  If  an  inaccessible  node  is  found  in  the  first  search,  then  the  total  cost  of 
searching  is  0(T|^|). 

3.  The  total  cost  of  searching,  if  the  second  search  is  called,  is  0{\H\). 

4.  If  no  inaccessible  node  is  found  in  the  first  search  but  an  inaccessible  node 
is  found  in  the  second  one,  then  >  j  inaccessible  nodes  are  found.  Thus  the 

total  cost  of  searching  per  inaccessible  node  found  is 
Proof  1.  Obvious. 

2.  |W"|  G  0{T\A\)  since  there  are  at  most  T  edges  in  into  each  node. 

3.  Suppose  a  is  inaccessible  in  but  accessible  in  H.  Then  a  E  X,  so  there 
are  >  T  edges  with  head  a.  Thus  at  least  one  tail  on  each  of  these  edges 
must  be  inaccessible  in  H,  since  otherwise  a  would  be  accessible.  Suppose 
<  j  inaccessible  nodes  are  found.  Then,  by  the  pigeon-hole  principle,  one 
of  these  inaccessible  nodes  must  occur  on  >  T’/(^)  =  //  edges,  contradicting 
the  assumption  on  (jl. 

4.  Immediate  from  the  previous  part  since  the  cost  of  one  depth-first  search  is 

0(|W1).  ■ 

To  get  a  near-optimal  value  for  T  above,  equate  T\A\  and  {fz[H\)/T.  Solv¬ 
ing  for  T  we  get  =  fjL\7i\/\A\.  This  gives  an  average  cost  of  searching  (per 
inaccessible  vertex  found)  of  ^/fJ^[H\\Al 

If  in  the  original  hypergraph  H  the  value  of  (defined  just  before  Algo¬ 
rithm  4)  is  small,  the  total  cost  of  the  algorithm  is  good.  But  there  is,  in  gen¬ 
eral,  no  reason  to  expect  that  value  of  //  to  be  small.  So  we  perform  another 
approximation. 

Algorithms.  Let  7^  be  a  hypergraph,  and  let  B  be  a  number.  Let 

Hi  =  {(a,  {61})  :  for  >  B  vertices  62,  (a,  {h,h})  G  H} 

H2  =  {e  G  W  :  no  subedge  of  e  is  in  Hi} 

H^  ^Hi\JH2 

Call  Sublgorithm  4  on  H^  with  T  =  ^^/B\H\1\A\. 

If  no  inaccessible  nodes  are  found  above, 
perform  a  depth-first  search  of  H. 


Lemma  9.  Let  Hy  B,  and  be  as  in  Sublgoriihm  5. 

1.  If  some  vertex  a  E  A  is  inaccessible  from  s  in  li\  it  is  inaccessible  from  s 
in  H. 

2.  No  edge  (a,  {6})  is  a  subedge  of>B  edges  oflV  (i.e,  the  property  of  Algo- 
rithm  4  holds  with  —  B). 

3.  If  every  vertex  a  E  A  is  accessible  from  s  in  Ti^  but  some  vertex  a  E  A  is 
inaccessible  from  s  in  Ti,  then  >  B  vertices  of  A  are  inaccessible  in  Ti. 

i-  \'hL'\  ^  |W|-  Thus:  (i)  The  cost  to  perform  the  search  in  this  algorithm  is 
in  0{T\A\)  if  inaccessible  nodes  are  found  by  the  first  step  of  Algorithm  4^ 
(ii)  If  no  inaccessible  nodes  are  found  by  the  first  step  of  Algorithm  4  but 
some  inaccessible  nodes  are  found  by  the  second  step,  the  cost  to  perform  the 
search  is  in  0(|W'|)  c  0(|7^|).  (iii)  If  no  inaccessible  nodes  are  found  by 
Algorithm  4,  the  total  cost  to  perform  the  search  is  0(|7{|). 

5.  If  this  subalgorithm  performs  the  search  of  H  and  inaccessible  nodes  are 
found,  then  the  average  cost  of  searching  for  each  inaccessible  node  found  is 

oi\n\/B). 

Proof 

1.  Obvious. 

2.  The  subedges  in  Hi  have  no  tails  in  common,  and  no  edge  in  Hi  is  a  subedge 
of  any  edge  in  7^2-  If  any  B  edges  in  H2  had  a  subedge  in  common,  that 
subedge  would  have  been  put  into  Hi, 

3.  Suppose  a  is  accessible  in  W  but  not  in  H.  Then  there  must  be  an  edge 
(a,  {6})  EH'  —  H  where  b  is  accessible  in  H  but  a  is  not.  By  definition  of 
H' ,  there  must  be  >  B  edges  {cl^{b,b2])  G  H,  Since  a  is  inaccessible  in  H, 
none  of  these  >  B  62’s  can  be  accessible  in  H  either. 

4.  That  \H'\  <  \H\  is  obvious.  Statements  (i)  and  (ii)  then  follow  from  Lemma  8. 
Statement  (iii)  follows  from  the  fact  that  the  searching  cost  is  dominated  by 
the  cost  of  the  search  of  H*  followed  by  the  search  of  H. 

5.  Follows  from  the  two  previous  steps.  I 

A  near-optimal  value  for  B  can  be  computed  by  equating  the  two  previous 
average  costs,  and  This  gives  B  =  yielding  a  final 

average  cost  of  The  total  search  cost  must  be  at  most  |^| 

times  this  —  thus  The  speedup  is  on  the  order  of  ^/fP\/\Al 

as  stated  earlier. 


We  put  our  modifications  together  into  the  following  algorithm.  In  order  to 
simplify  the  bookkeeping,  we  divide  the  program  into  strata  only  once,  after  the 
initial  application  of  Fitting's  algorithm. 

Algorithm  6. 

Initialize  r  =  0.  Initialize  Ji  and  %  as  usual. _ 

Perform  the  Fitting  step,  updating  r.li  and  H. 

Decompose  A  into  its  strongly  connected  components  ^i, . .  .,-4j  under  <dep 
(defined  from  Ti). 

Topologically  sort  the  components,  >li ,  ^2,  •  •  •  j  • 

Construct  the  subprograms  'Pi ,  “^2,  •  •  •  ?  'Pn • 

For  z  =  1  to  n: 

Construct  the  subhypergraphs  'Hi.Tii  and  construct  Tii  (as  in  Algorithm  5). 
Repeat  until  no  inaccessible  edges  are  found  in  the  next  step: 

Call  Algorithm  5  on  Tii, 

If  any  inaccessible  vertices  are  found. 

Update  T  and  all  's,  Tlj 's,  and  ’s  for  j  >  _ 

Perform  the  Fitting  step,  updating  r  and  all  Wj’Sj  Tij^s,  Wj's  for  j  >  i. 

The  well-founded  partial  model  of  P  is  the  final  partial  interpretation  r . 

It  turns  out  that  all  the  bookkeeping  and  graph  construction  and  update  for 
the  above  algorithm  can  be  done  in  time  |P|  +  \A\^ y  as  we  shall  comment  upon 
in  the  next  section.  Combining  this  with  the  preceding  remarks,  we  have: 

Theorem  10.  Algorithm  6  computes  the  well-founded  semantics  for  any  propo¬ 
sitional  logic  program  |P|  in  proposition  letters  A  with  <  2  positive  subgoals  per 
clause  in  time  0(|P|  +  [Ap  +  |A|3  |P|3). 

Since  we  expect  that  in  common  examples  >  IPI  + 

we  have  simplified  the  result  above  to  saying  that  generally  our  speed-up  over 
the  straight  Van  Gelder  algorithm  is  Obviously,  if  E 

0(|P|),  the  algorithm  performs  in  time  0(P),  which  is  best  possible. 

In  practice,  it  would  seem  wise  to  optimize  the  algorithm  by,  approximately 
once  every  ^jp|/|A|  iterations  of  the  outer^^epeat  loop,  replacing  the  call  to 
Algorithm  5  with  a  full  depth-first  search  of  Tii.  Except  for  P  much  larger  than 
Ay  this  would  not  hurt  the  worst-case  performance  of  the  algorithm,  and  it  might 
help  if  the  algorithm  is  finding  many  small  unfounded  sets^when  finding  some 
large  unfounded  set  would  substantially  reduce  the  size  of  H, 


Implementation 

What  we  have  left  to  do  is  to  show  that  all  the  necessary  bookkeeping  for  the 
algorithm  can  be  done  in  time  0(|P|).  The  basic  work  is  in  choosing  the  appro¬ 
priate  data  structures  and  the  right  counts  to  store  and  in  doing  enough  work 


in  preprocessing,  leaving  mostly  updating  of  the  data  structures  and  the  counts 
for  run  time  (plus,  of  course,  many  depth-first  searches).  All  this  can  be  done; 
we  just  sketch  the  construction  below. 

For  each  a  G  -4  we  store  3  doubly-linked  lists  of  pointers:  pointers  to  hyper- 
edges  with  head  a,  hyperedges  with  a  in  the  tail,  and  hyperedges  with  in 
the  presupposition.  (In  fact,  we  have  to  store  the  presuppositions  themselves  as 
doubly  linked  lists,  simplifying  according  to  the  usual  logic  rules  each  time  an 
atom  is  first  set  to  T  of  F.)  We  store  each  'Hiy  and  7i^  and  update  them 
dynamically;  thus  we  actually  need  one  set  of  the  3-doubly  linked  lists  of  point¬ 
ers  above  for  each  of  the  three  graphs.  For  this  we  need  a  pointer  from  each 
edge  in  Hi  to  its  subedge  (if  any)  in  a  counter  on  the  subedge  telling  how 
many  edges  point  to  it  (so  we  can  tell  when  to  remove  the  subedge  from  H  and 
reinstall  the  original  edges),  and  doubly-linked  lists  going  from  the  subedge  the 
edges  from  which  it  is  derived  to  allow  us  to  reinstall  the  original  edges  quickly. 
(Importantly,  each  reinstallation  is  done  only  once.)  We  store  and  dynamically 
update  an  |^|-length  array  giving  the  number  of  rules  of  with  head  each  vari¬ 
able  a;  this  allows  us  to  create  the  approximations  Hi  of  Algorithm  4  quickly. 
In  the  construction  we  use  an  1^|  x  |.A|  array  giving  the  number  of  times  each 
atom  appears  as  a  positive  subgoal  in  a  rule  with  head  each  a'  G 

6  Future  Work 

There  are  two  clear  open  directions  left  by  this  work,  both  of  which  we  intend 
to  pursue.  One  is  to  relax  the  restriction  to  two  positive  subgoals  per  rule.  We 
expect  this  can  be  done,  but  it  is  not  clear  how  high  the  cost  is  going  to  be; 
we  do  not  expect  to  be  able  to  achieve  the  same  speedup  we  have  here.  (By 
analogy,  a  better  speedup  can  be  achieved  if  we  restrict  attention  to  programs 
with  only  one  positive  literal  per  clause.)  The  other  direction,  as  noted  before,  is 
to  integrate  an  algorithm  such  as  ours  with  actual  deductive-database  inference 
techniques. 
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